Objects as Attributes for Scene Classification
نویسندگان
چکیده
Robust low-level image features have proven to be effective representations for a variety of high-level visual recognition tasks, such as object recognition and scene classification. But as the visual recognition tasks become more challenging, the semantic gap between low-level feature representation and the meaning of the scenes increases. In this paper, we propose to use objects as attributes of scenes for scene classification. We represent images by collecting their responses to a large number of object detectors, or “object filters”. Such representation carries high-level semantic information rather than low-level image feature information, making it more suitable for high-level visual recognition tasks. Using very simple, off-the-shelf classifiers such as SVM, we show that this object-level image representation can be used effectively for high-level visual tasks such as scene classification. Our results are superior to reported state-of-the-art performance on a number of standard datasets.
منابع مشابه
3D Scene and Object Classification Based on Information Complexity of Depth Data
In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...
متن کاملA Generalized Scene Graph
Scene graphs are fundamental data structures for hierarchical scene modeling. The generalized scene graph overcomes various limitations of current scene graph architectures such as support for different 3D rendering systems, integration of multi-pass rendering, and declarative modeling of scenes. The main idea is to separate scene specification from scene evaluation. To specify scenes, scene gr...
متن کاملHierarchical Space Tiling (HST) for scene classification and attribute localization
In this paper, we propose a representation named Hierarchical Space Tiling (HST) and a weakly supervised method for simultaneously leaning the scene configurations and attributes from a collection of natural images associated with attributes in text, where the precise localization of each attributes left unknown. We start with an over-complete HST which quantizes the huge and continuous scene c...
متن کاملModified CLPSO-based fuzzy classification System: Color Image Segmentation
Fuzzy segmentation is an effective way of segmenting out objects in images containing both random noise and varying illumination. In this paper, a modified method based on the Comprehensive Learning Particle Swarm Optimization (CLPSO) is proposed for pixel classification in HSI color space by selecting a fuzzy classification system with minimum number of fuzzy rules and minimum number of incorr...
متن کاملGenerating Triples with Adversarial Networks for Scene Graph Construction
Driven by successes in deep learning, computer vision research has begun to move beyond object detection and image classification to more sophisticated tasks like image captioning or visual question answering. Motivating such endeavors is the desire for models to capture not only objects present in an image, but more fine-grained aspects of a scene such as relationships between objects and thei...
متن کامل